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DETAILED ACTION 

Claim Rejections - 35 USC §102 

1. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public use or on 
sale in this country, more than one year prior to the date of application for patent in the United States. 

2. Claims 1, 3-16, 18-21, 23-36, 38-47 are rejected under 35 U.S.C. 102(b) as being 
anticipated by Ellozy et al. (5,649,060). 

As to claim 1, Ellozy et al. teach 

A system for cataloguing electronic information, comprising: 

an electronic device that captures audio/video data corresponding to a photographic 

target, said audio/video data including a narration provided by a narrator (Fig. 2 col. 4 lines 52- 

56); 

a speech recognition engine that automatically performs a speech recognition process 
upon said narration to generate labels that correspond to respective subject matter locations in 
said audio/video data (col. 8 lines 3-10); and 

a label manager that manages a label mode for generating and storing said labels, said 
label manager also controlling a label search mode for utilizing said labels to locate said 
respective subject matter locations in said audio/video data(col. 8 lines 3-10; col. 3 lines 54-59; 
and col. 6 lines 61-65; the mapping module works as a label manager that stores, via a text 
storage device, and generates labels regarding audio/video). 
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Claim 21 is directed toward a method to implement or execute the system of claim 1, and 
is similar in scope and content of claim 1, therefore, claim 21 is rejected under similar rationale. 

As to claims 3 and 23, which depend on claims 1 and 21, Ellozy et al. teach 
wherein said speech recognition engine is configured in a simplified configuration that 
efficiently compares said narration with acoustic models to identify phone strings that represent 
said narration, said speech recognition engine referencing a compact dictionary to look up 
recognized vocabulary words that correspond to said phone strings, said speech recognition 
engine utilizing a limited set of recognition grammar to form said recognized vocabulary words 
into said labels that are supported by said speech recognition engine (col. 7 lines 20-30 and col. 3 
lines 44-50). 

As to claims 4 and 24, which depend on claims 1 and 21, Ellozy et al. teach 
wherein said label manager initially instructs said electronic device to enter a real-time 
label mode for creating and storing said labels, said electronic device concurrently capturing said 
audio/video data and said narration after said label manager instructs said electronic device to 
enter said real-time label mode (col. 6 lines 66-67 and col. 7 lines 1-5). 

As to claims 5 and 25, which depend on claims 1 and 21, Ellozy et al. teach 
wherein said electronic device enters a real-time label mode in response to a verbal label- 
mode command from a system user, said verbal label-mode command being recognized and 
provided to said label manager by said speech recognition engine (col. 3 lines 44-50; col. 8 lines 
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3-10; and col. 6 lines 61-65). 

As to claim 6 and 26, which depends on claims 1 and 21, Ellozy et al. teach 
wherein said speech recognition engine automatically generates said labels as said 
electronic device captures said audio/video data and said narration (col. 8 lines 3-10). 



As to claims 7 and 27, which depend on claims 1 and 21, Ellozy et al. teach 
wherein a post processor performs a post-processing procedure upon said labels in a real- 
time label mode, said post-processing procedure including a validation procedure using one or 
more confidence measures to eliminate invalid labels that fail to satisfy pre-determined 
validation criteria (col. 7 lines 53-63). 

As to claims 8 and 28, which depend on claims 1 and 21, Ellozy et al. teach 

wherein said label manager stores said labels during a real-time label mode, said labels 

being stored along with meta-information that associates each of said respective subject. matter 

locations to a corresponding one of said labels (col. 8 lines 3-10). 

As to claims 9 and 29, which depend on claims 1 and 21, Ellozy et al. teach 
wherein said electronic device initially captures said audio/video data and said narration 
prior to entering said label mode (col. 8 lines 3-10). 
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As to claims 10 and 30, which depend on claims 1 and 21, Ellozy et al. teach 
wherein said label manager instructs said electronic device to enter a non-real-time label 
mode for creating and storing said labels, said electronic device responsively retrieving and 
playing back said audio/video data and said narration (col. 7 lines 50-55 and col. 8 lines 3-5 and 
7-10). 

As to claims 1 1 and 31, which depend on claims 1 and 21, Ellozy et al teach 

wherein said speech recognition engine automatically generates said labels by analyzing 

said audio/video data and said narration as said electronic device plays back said audio/video 

data and said narration (col. 4 lines 19-28 and col. 5 lines 1-2). 

As to claims 12 and 32, which depend on claims 1 and 21, Ellozy et al. teach 
wherein a post processor performs a post-processing procedure upon said labels in a non- 
real-time label mode, said post-processing procedure including a validation procedure using one 
or more confidence measures to eliminate invalid labels that fail to satisfy pre-determined 
validation criteria (col. 7 lines 53-60). 

As to claims 13 and 33, which depend on claims 1 and 21, Ellozy et al. teach 
wherein said label manager coordinates a label validation procedure for validating said 
labels, said label manager generating a validation graphical user interface upon a display of said 
electronic device for a system user to interactively evaluate, delete, and edit said labels (col. 4 
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lines 19-28). 

As to claims 14 and 34, which depend on claims 1 and 21, Ellozy et al. teach 
wherein said label manager coordinates a label validation procedure for validating said 
labels in response to verbal validation commands from a system user, said verbal validation 
commands being recognized and provided to said label manager by said speech recognition 
engine (col. 3 lines 50-53 and col. 4 lines 28-31 and 61-67 and col. 7 lines 51-55; validation is 
done via alignment, time stamping and confidence scoring). 

As to claims 15 and 35, which depend on claims 1 and 21, Ellozy et al. teach 
wherein said label manager stores said labels in a non-real-time label mode, said labels 

being stored along with meta-information that associates each of said respective subject matter 

locations to a corresponding one of said labels (col. 4 lines 9-19). 

As to claims 16 and 36, which depend on claims 1 and 21, Ellozy et al. teach 
wherein said label manager instructs said electronic device to enter said label search 
mode during which a system user interactively selects a search label for performing a label 
search procedure to locate a specific one of said respective subject matter locations 
corresponding to said search label (col. 4 lines 19-27; col. 3 lines 51-53 and col. 7 lines 50-59 ). 

As to claims 18 and 38, which depend on claims 1 and 21, Ellozy et al. teach 
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wherein a system user selects a search label by issuing a verbal search-label command, 
said verbal search-label command being recognized and provided to said label manager by said 
speech recognition engine (col. 3 lines 51-53 and col. 7 lines 50-59). 

As to claims 19 and 39, which depend on claims 1 and 21, Ellozy et al. teach 
wherein said label manager instructs said electronic device to automatically locate and 
retrieve a specific one of said respective subject matter locations in response to a system user 
selecting a search label (col. 4 lines 65-67 and col. 5 lines 1-2; automatic location is done via the 
time stamping of the transcripts). 

As to claims 20 and 30, which depend on claims 1 and 21, Ellozy et al. teach 
wherein said electronic device automatically plays back a specific retrieved one of said 

respective subject matter locations from said audio/video data for viewing by said system user 

(col. 4 lines 65-67 and col. 5 lines 1-2). 

Claim 41 is directed toward a computer readable medium to implement or execute the 
system of claim 1, and is similar in scope and content of claim 1, therefore, claim 41 is rejected 
under similar rationale. 

Claims 42-47 are directed toward a system to implement or execute the method of claim 
21, and is similar in scope and content of claim 21, therefore, claims 42-47 is rejected under 
similar rationale. 
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Claim Rejections - 35 USC §103 

3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

4. Claims 2, 17, 22 and 37 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Ellozy et al. (5,649,060) in view of Durlach (6,807,367). 

As to claims 2 and 22, which depend on claims 1 and 21, Ellozy et al. teach electronic 
device (col. 3 lines 60-61, text storage device, which can be stored on a tape or disk). 

Ellozy et al. do not explicitly teach wherein said electronic device is implemented as an 
audio/video camcorder device. 

However, Durlach does teach wherein said electronic device is implemented as an 
audio/video camcorder device (col. 1 lines 43-55). 

Therefore, it would have been obvious to one of ordinary skill in the art at the time of the 
invention to implement the indexing of audio and text using speech recognition stored on a text 
storage device into the camcorder of Durlach, because Durlach teaches that this would provide 
consumers general-purpose computer animation packages, in which creating a new movie image 
content is relatively inexpensive and straightforward even for the individual consumer (Durlach, 
col. 1 lines 43-55). 

As to claims 17 and 37, which depend on claims 1 and 21, Ellozy et al. teach wherein 
said label manager generates a label-search on an electronic device, a system user viewing said 
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labels and corresponding representative images from said audio/video data for selecting a search 
label (col. 3 lines 51-53; col. 3 lines 53-59 and col. 7 lines 50-59). 
Ellozy et al. do not explicitly teach display. 

However, Durlach does teach a GUI on a display (col. 1 lines 12-17). 

Therefore, it would have been obvious to one of ordinary skill in the art at the time of the 
invention to implement the indexing of audio and text using speech recognition stored on a text 
storage device into a display of Durlach, because Durlach teaches that this would provide a 
movie display system to interactively implement and provide for dynamic selection of displayed 
movie segments and control of the temporal presentation rate of these segments, col. 1 lines 12- 
17. 

Conclusion 

1 . The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure, see PTO-892. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Myriam Pierre whose telephone number is 571-272-761 1 . The 
examiner can normally be reached on 8:30-5:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on 571-272-7602. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (BBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

MP 6/26/06 




/ RICHEMOND DORVIL 
SUPERVISORY PATENT EXAMINER 



